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RAW SEQUENCE LISTING DATE: 10/04/2001 

PATENT APPLICATION: US/09/866,307 * TIME: 17:40:34 

Input Set : A:\sequence.txt 

Output Set: N:\CRF3\10042001\I866307.raw 

SEQUENCE LISTING 

3 (1) GENERAL INFORMATION: 

4 (i) APPLICANT: LAWRENCE P. WACKETT 

5 (ii) TITLE OF INVENTION: DNA MOLECULES AND PROTEIN DISPLAYING 

6 IMPROVED TRIAZINE COMPOUND DEGRADING ABILITY 

7 (iii) NUMBER OF SEQUENCES: 26 

8 (iv) CORRESPONDENCE ADDRESS: 

9 (A) ADDRESSEE: MUETING ; RAASCH & GEBHARDT , P. A. 

10 (B) STREET: 119 North Fourth Street 

11 (C) CITY: Minneapolis 

12 (D) STATE: Minnesota 

13 (E) COUNTRY: USA 

14 (F) ZIP: 55401 

15 (v) COMPUTER READABLE FORM: 

16 (A) MEDIUM TYPE: Floppy disk 

17 (B) COMPUTER: IBM PC compatible 

18 (C) OPERATING SYSTEM: PC -DOS/MS -DOS 

19 (D) SOFTWARE: Patentln Release #1.0, Version #1.30 

20 (vi) CURRENT APPLICATION DATA: 

C--> 21 (A) APPLICATION NUMBER: US/09/866,307 

C--> 22 (B) FILING DATE: 20-Sep-2001 

23 (C) CLASSIFICATION: 

24 (vii) PRIOR APPLICATION DATA: 

25 (A) APPLICATION NUMBER: 60/035,404 

26 (B) FILING DATE: 17-JAN-1997 

27 (A) APPLICATION NUMBER: PCT/US98/00944 

28 (B) FILING DATE: 16-JAN-1998 

29 (A) APPLICATION NUMBER: 09/155,036 

30 (B) FILING DATE: 17-SEPT-1998 

31 (viii) ATTORNEY/AGENT INFORMATION: 

32 (A) NAME: PROVENCE, DAVID L. / 

33 (B) REGISTRATION NUMBER: 43,022 

34 (C) REFERENCE/DOCKET NUMBER: 110.00440102 

35 (ix) TELECOMMUNICATION INFORMATION: 

36 (A) TELEPHONE: 612-305-1005 

37 (B) TELEFAX: 612-305-1228 
39 (2) INFORMATION FOR SEQ ID NO: 1: 

41 (i) SEQUENCE CHARACTERISTICS: 

42 (A) LENGTH: 1858 base pairs 

43 (B) TYPE: nucleic acid 

44 (C) STRANDEDNESS : single 

45 (D) TOPOLOGY: linear 

47 (ii) MOLECULE TYPE: DNA (genomic) 

50 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

52 CTCGGGTAAC TTCTTGAGCG CGGCCACAGC AGCCTTGATC ATGAAGGCGA GCATGGTGAC 60 
54 CTTGACGCCG CTCTTTTCGT TCTCTTTGTT GAACTGCACG CGAAAGGCTT CCAGGTCGGT 120 
56 GATGTCCGCG TCGTCGTGGT TGGTGACGTG CGGGATGACC ACCCAGTTGC GGTGCAGGTT 180 
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RAW SEQUENCE LISTING DATE: 10/04/2001 

PATENT APPLICATION: US/09/866,307 TIME: 17:40:34 



Input Set : A:\sequence.txt 

Output Set: N:\CRF3\10042001\l866307.raw 

58 TTTCGATGGC ATAATATCTG CGTTGCGACG TGTAACACAC TATTGGAGAC ATATCATGCA 240 

60 AACGCTCAGC ATCCAGCACG GTACCCTCGT CACGATGGAT CAGTACCGCA GAGTCCTTGG 300 

62 GGATAGCTGG GTTCACGTGC AGGATGGACG GATCGTCGCG CTCGGAGTGC ACGCCGAGTC 360 

64 GGTGCCTCCG CCAGCGGATC GGGTGATCGA TGCACGCGGC AAGGTCGTGT TACCCGGTTT 420 

66 CATCAATGCC CACACCCATG TGAACCAGAT CCTCCTGCGC GGAGGGCCCT CGCACGGACG 480 

68 TCAATTCTAT GACTGGCTGT TCAACGTTGT GTATCCGGGA CAAAAGGCGA TGAGACCGGA 540 

70 GGACGTAGCG GTGGCGGTGA GGTTGTATTG TGCGGAAGCT GTGCGCAGCG GGATTACGAC 600 

72 GATCAACGAA AACGCCGATT CGGCCATCTA CCCAGGCAAC ATCGAGGCCG CGATGGCGGT 660 

74 CTATGGTGAG GTGGGTGTGA GGGTCGTCTA CGCCCGCATG TTCTTTGATC GGATGGACGG 720 

76 GCGCATTCAA GGGTATGTGG ACGCCTTGAA GGCTCGCTCT GCCCAAGTCG AACTGTGCTC 780 

78 GATCATGGAG GAAACGGCTG TGGCCAAAGA TCGGATCACA GCCCTGTCAG ATCAGTATCA 840 

80 TGGCACGGCA GGAGGTCGTA TATCAGTTTG GCCCGCTCCT GCCACTACCA CGGCGGTGAC 900 

82 AGTTGAAGGA ATGCGATGGG CACAAGCCTT CGCCCGTGAT CGGGCGGTAA TGTGGACGCT 960 

84 TCACATGGCG GAGAGCGATC ATGATGAGCG GATTCATGGG ATGAGTCCCG CCGAGTACAT 1020 

86 GGAGTGTTAC GGACTCTTGG ATGAGCGTCT GCAGGTCGCG CATTGCGTGT ACTTTGACCG 1080 

88 GAAGGATGTT CGGCTGCTGC ACCGCCACAA TGTGAAGGTC GCGTCGCAGG TTGTGAGCAA 1140 

90 TGCCTACCTC GGCTCAGGGG TGGCCCCCGT GCCAGAGATG GTGGAGCGCG GCATGGCCGT 1200 

92 GGGCATTGGA ACAGATAACG GGAATAGTAA TGACTCCGCA AACATGATCG GAGACATGAA 1260 

94 GTTTATGGCC CATATTCACC GCGCGGTGCA TCGGGATGCG GACGTGCTGA CCCCAGAGAA 1320 

96 GATTCTTGAA ATGGCGACGA TCGATGGGGC GCGTTCGTTG GGAATGGACC ACGAGATTGG 1380 

98 TTCCATCGAA ACCGGCAAGC GCGCGGACCT TATCCTGCTT GACCTGCGTC ACCTCAGACG 1440 

100 ACTCTCACAT CATTTGGCGG CCACGATCGT GTTTCAGGCT TACGGCAATG AGGTGGACAC 1500 

102 TGTCCTGATT GACGGAAACG TTGTGATGGA GAACCGCCGC TTGAGCTTTC TTCCCCCTGA 1560 

104 ACGTGAGTTG GCGTTCCTTG AGGAAGCGCA GAGCCGCGCC ACAGCTATTT TGCAGCGGGC 1620 

106 GAACATGGTG GCTAACCCAG CTTGGCGCAG CCTCTAGGAA ATGACGCCGT TGCTGCATCC 1680 

108 GCCGCCCCTT GAGGAAATCG CTGCCATCTT GGCGCGGCTC GGATTGGGGG GCGGACATGA 1740 

110 CCTTGATGGA TACAGAATTG CCATGAATGC GGCACTTCCG TCCTTCGCTC GTGTGGAATC 1800 

112 GTTGGTAGGT GAGGGTCGAC TGCGGGCGCC AGCTTCCCGA AGAGGTGAAA GGCCCGAG 1858 
114 (2) INFORMATION FOR SEQ ID NO: 2: 

116 (i) SEQUENCE CHARACTERISTICS: 

117 (A) LENGTH: 473 amino acids 



122 (ii) MOLECULE TYPE: protein 

127 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

129 Met Gin Thr Leu Ser lie Gin His Gly Thr Leu Val Thr Met Asp Gin 

130 1 5 10 15 

132 Tyr Arg Arg Val Leu Gly Asp Ser Trp Val His Val Gin Asp Gly Arg 

133 20 25 30 

135 He Val Ala Leu Gly Val His Ala Glu Ser Val Pro Pro Pro Ala Asp 

136 35 40 45 

138 Arg Val lie Asp Ala Arg Gly Lys Val Val Leu Pro Gly Phe He Asn 

139 50 55 60 

141 Ala His Thr His Val Asn Gin He Leu Leu Arg Gly Gly Pro Ser His 

142 65 70 75 80 

144 Gly Arg Gin Phe Tyr Asp Trp Leu Phe Asn Val Val Tyr Pro Gly Gin 

145 85 90 95 

147 Lys Ala Met Arg Pro Glu Asp Val Ala Val Ala Val Arg Leu Tyr Cys 
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219 (2) INFORMATION FOR SEQ ID NO: 3: 
221 (i) SEQUENCE CHARACTERISTICS: 
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RAW SEQUENCE LISTING DATE: 10/04/2001 

PATENT APPLICATION: US/09/866,307 TIME: 17:40:34 

Input Set : A:\sequence.txt 

Output Set: N:\CRF3\10042001\l866307.raw 

222 (A) LENGTH: 1808 base pairs 

223 (B) TYPE: nucleic acid 

224 (C) STRANDEDNESS : single 

225 (D) TOPOLOGY: linear 

227 (ii) MOLECULE TYPE: DNA (genomic) 

232 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

234 GCGAGCATGG TGACCTTGAC GCCGCTCTTT TCGTTCTCTT TGTTGAACTG CACGCGAAAG 60 

236 GCTTCCAGGT CGGTGATGTC CGCGTCGTCG TGGTTGGTGA CGTGCGGGAT GACCACCCAG 120 

238 TTGCGGTGCA GGTTTTTCGA • TGGCATAATA TCTGCGTTGC GACGTGTAAC ACACTATTGG 180 

240 AGACATATCA TGCAAACGCT CAGCATCCAG CACGGTACCC TCGTCACGAT GGATCAGTAC 240 

242 CGCAGAGTCC TTGGGGATAG CTGGGTTCAC GTGCAGGATG GACGGATCGT CGCGCTCGGA 300 

244 GTGCACGCCG AGTCGGTGCC TCCGCCAGCG GATCGGGTGA TCGATGCACG CGGCAAGGTC 360 

246 GTGTTACCCG GTTTCATCAA TGCCCACACC CATGTGAACC AGATCCTCCT GCGCGGAGGG 420 

248 CCCTCGCACG GGCGTCAATT CTATGACTGG CTGTTCAACG TTGTGTATCC GGGACAAAAG 480 

250 GCGATGAGAC CGGAGGACGT AGCGGTGGCG GTGAGGTTGT ATTGTGCGGA AGCTGTGCGC 540 

252 AGCGGGATTA CGACGATCAA CGAAAACGCC GATTCGGCCA TCTACCCAGG CAACATCGAG 600 

254 GCCGCGATGG CGGTCTATGG TGAGGTGGGT GTGAGGGTCG TCTACGCCCG CATGTTCTTT 660 

256 GATCGGATGG ACGGGCGCAT TCAAGGGTAT GTGGACGCCT TGAAGGCTCG CTCTCCCCAA 720 

258 GTCGAACTGT GCTCGATCAT GGAGGGAACG GCTGTGGCCA AAGATCGGAT CACAGCCCTG 780 

260 TCAGATCAGT ATCATGGCAC GGCAGGAGGT CGTATATCAG TTTGGCCCGC TCCTGCCACT 840 

262 ACCACGGCGG TGACAGTTGA AGGAATGCGA TGGGCACAAG CCTTCGCCCG TGATCGGGCG 900 

264 GTAATGTGGA CGCTTCACAT GGCGGAGAGC GATCATGATG AGCGGATTCA TGGGATGAGT 960 

266 CCCGCCGAGT ACATGGAGTG TTACGGACTC TTGGATGAGC GTCTGCAGGT CGCGCATTGC 1020 

268 GTGTACTTTG ACCGGAAGGA TGTTCGGCTG CTGCACCGCC ACAATGTGAA GGTCGCGTCG 1080 

270 CAGGTTGTGA GCAATGCCTA CCTCGGCTCA GGGGTGGCCC CCGTGCCAGA GATGGTGGAG 1140 

272 CGCGGCATGG CCGTGGGCAT TGGAACAGAT AACGGGAATA GTAATGACTC CGTAAACATG 1200 

274 ATCGGAGACA TGAAGTTTAT GGCCCATATT CACCGCGCGG TGCATCGGGA TGCGGACGTG 1260 

276 CTGACCCCAG AGAAGATTCT TGAAATGGCG ACGATCGATG GGGCGCGTTC GTTGGGAATG 1320 

278 GACCACGAGA TTGGTTCCAT CGAAACCGGC AAGCGCGCGG ACCTTATCCT GCTTGACCTG 1380 

280 CGTCACCCTC AGACGACTCC TCACCATCAT TTGGCGGCCA CGATCGTGTT TCAGGCTTAC 1440 

282 GGCAATGAGG TGGACACTGT CCTGATTGAC GGAAACGTTG TGATGGAGAA CCGCCGCTTG 1500 

284 AGCTTTCTTC CCCCTGAACG TGAGTTGGCG TTCCTTGAGG AAGCGCAGAG CCGCGCCACA 1560 

286 GCTATTTTGC AGCGGGCGAA CATGGTGGCT AACCCAGCTT GGCGCAGCCT CTAGGAAATG 1620 

288 ACGCCGTTGC TGCATCCGCC GCCCCTTGAG GAAATCGCTG CCATCTTGGC GCGGCTCGGA 1680 

290 TTGGGGGGCG GACATGACCT TGATGGATAC AGAATTGCCA TGAATGCGGC ACTTCCGTCC 1740 

292 TTCGCTCGTG TGGAATCGTT GGTAGGTGAG GGTCGACTGC GGGCGCCAGC TTCCCGAAGA 1800 

294 AGTGAAAG 1808 
296 (2) INFORMATION FOR SEQ ID NO: 4: 

298 (i) SEQUENCE CHARACTERISTICS: 

299 (A) LENGTH: 1846 base pairs 

300 (B) TYPE: nucleic acid 

301 <C) STRANDEDNESS: single 

302 (D) TOPOLOGY: linear 

304 (ii) MOLECULE TYPE: DNA (genomic) 

309 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

311 GAGCGCCGCC ACAGCAGCCT TGATCATGAA GGCGAGCATG GTGACCTTGA CGCCGCTCTT 60 

313 TTCGTTCTCT TTGTTGAACT GCACGCGAAA GGCTTCCAGG TCGGTGATGT CCGCGTCGTC 120 

315 GTGGTTGGTG ACGTGCGGGA TGACCACCCA GTTGCGGTGC AGGTTTTTCG ATGGCGTAAT 180 

317 ATCTGCGTTG CGACGTGTAA CACACTATTG GAGACATATC ATGCAAACGC TCAGCATCCA 240 
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RAW SEQUENCE LISTING DATE: 10/04/2001 

PATENT APPLICATION: US/09/866 r 307 TIME: 17:40:34 



Input Set ; A:\sequence.txt 

Output Set: N:\CRF3\10042001\l866307.raw 

319 GCACGGTACC CTCGTCACGA TGGATCAGTA CCGCAGAGTC CTTGGGGATA GCTGGGTTCA 300 
321 CGTGCAGGAT GGACGGATCG TCGCGCTCGG AGTGCACGCC GAGTCGGTGC CTCCGCCAGC " 360 

323 GGATCGGGTG ATCGATGCAC GCGGCAAGGT CGTGTTACCC GGTTTCATCA ATGCCCACAC 420 

325 CCATGTGAAC CAGATCCTCC TGCGCGGAGG GCCCTCGCAC GGGCGTCAAT TCTATGACTG 4 80 

327 GCTGTTCAAC GTTGTGTATC CGGGACAAAA GGCGATGAGA CCGGAGGACG TAGCGGTGGC 540 

329 GGTGAGGTTG TATTGTGCGG AAGCTGTGCG CAGCGGGATT ACGACGATCA ACGAAAACGC 600 

331 CGATTCGGCC ATCTACCCAG GCAACATCGA GGCCGCGATG GCGGTCTATG GTGAGGTGGG 660 

333 TGTGAGGGTC GTCTACGCCC GCATGTTCTT TGATCGGATG GACGGGCGCA TTCAAGGGTA 720 

335 TGTGGACGCC TTGAAGGCTC GCTCTCCCCA AGTCGAACTG TGCTCGATCA TGGAGGAAAC 780 

337 GGCTGTGGCC AAAGATCGGA TCACAGCCCT GTCAGATCAG TATCATGGCA CGGCAGGAGG 840 

339 TCGTATATCA GTTTGGCCCG CTCCTGCCAC TACCACGGCG GTGACAGTTG AAGGAATGCG 900 

341 ATGGGCACAA GCCTTCGCCC GTGATCGGGC GGTAATGTGG ACGCTTCACA TGGCGGAGAG 960 

34 3 CGATCATGAT GAGCGGATTC ATGGGATGAG TCCCGCCGAT TACATGGAGT GTTACGGACT 1020 

345 CTTGGATGAG CGTCTGCAGG TCGCGCATTG CGTGTACTTT GACCGGAAGG ATGTTCGGCT 1080 

347 GCTGCACCGC CACAATGTGA AGGTCGCGTC GCAGGTTGTG AGCAATGCCT ACCTCGGCTC 1140 

349 AGGGGTGGCC CCCGTGCCAG AGATGGTGGA GCGCGGCATG GCCGTGGGCA TTGGAACAGA 1200 

351 TAACGGGAAT AGTAATGACT CCGTAAACAT GATCGGAGAC ATGAAGTTTA TGGCCCATAT 1260 

353 TCACCGCGCG GTGCATCGGG ATGCGGACGT GCTGACCCCA GAGAAGATTC TTGAAATGGC 1320 

355 GACGATCGAT GGGGCGCGTT CGTTGGGGAT GGACCACGAG ATTGGTTCCA TCGAAACCGG 1380 

357 CAAGCGCGCG GACCTTATCC TGCTTGACCT GCGTCACCCT CAGACGACTC CTCACCATCA 1440 

359 TTTGGCGGCC ACGATCGTGT TTCAGGCTTA CGGCAATGAG GTGGACACTG TCCTGATTGA 1500 

361 CGGAAACGTT GTGATGGAGA ACCGCCGCTT GAGCTTTCTT CCCCCTGAAC GTGAGTTGGC 1560 

363 GTTCCTTGAG GAAGCGCAGA GCCGCGCCAC AGCTATTTTG CAGCGGGCGA ACATGGTGGC 1620 

365 TAACCCAGCT TGGCGCAGCC TCTAGGAAAT GACGCCGTTG CTGCATCCGC CGCCCCTTGA 1680 

367 GGAAATCGCT GCCATCTTGG CGCGGCTCGG ATTGGGGGGC GGACATGACC TTGATGGATA 1740 

369 CAGAATTGCC ATGAATGCGG CACTTCCGTC CTTCGCTCGT GTGGAATCGT TGGTAGGTGA 1800 

371 GGGTCGACTG CGGGCGCCAG CTTCCCGAAG AAGTGAAAGG CCCGAG 1846 
373 (2) INFORMATION FOR SEQ ID NO: 5: 

375 (i) SEQUENCE CHARACTERISTICS: 

376 (A) LENGTH: 601 amino acids 



381 (ii) MOLECULE TYPE: protein 

386 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

388 Ala Ser Met Val Thr Leu Thr Pro Leu Phe Ser Phe Ser Leu Leu Asn 

389 1 5 10 15 

391 Cys Thr Arg Lys Ala Ser Arg Ser Val Met Ser Ala Ser Ser Trp Leu 

392 20 25 30 

394 Val Thr Cys Gly Met Thr Thr Gin Leu Arg Cys Arg Phe Phe Asp Gly 

395 35 40 45 

397 lie lie Ser Ala Leu Arg Arg Val Thr His Tyr Trp Arg His He Met 

398 50 55 60 

400 Gin Thr Leu Ser He Gin His Gly Thr Leu Val Thr Met Asp Gin Tyr 

401 65 70 75 80 

403 Arg Arg Val Leu Gly Asp Ser Trp Val His Val Gin Asp Gly Arg He 

404 85 90 95 

406 Val Ala Leu Gly Val His Ala Glu Ser Val Pro Pro Pro Ala Asp Arg 

407 100 105 110 
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